Dataset statistics
| Number of variables | 11 |
|---|---|
| Number of observations | 936 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 258.5 KiB |
| Average record size in memory | 282.8 B |
Variable types
| NUM | 8 |
|---|---|
| CAT | 3 |
Reproduction
| Analysis started | 2020-05-10 12:12:52.836389 |
|---|---|
| Analysis finished | 2020-05-10 12:13:27.048553 |
| Version | pandas-profiling v2.6.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
Title has a high cardinality: 935 distinct values | High cardinality |
Genre has a high cardinality: 200 distinct values | High cardinality |
Director has a high cardinality: 607 distinct values | High cardinality |
Rank is highly correlated with df_index | High Correlation |
df_index is highly correlated with Rank | High Correlation |
| Distinct count | 936 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 497.18589743589746 |
|---|---|
| Minimum | 0 |
| Maximum | 999 |
| Zeros | 1 |
| Zeros (%) | 0.1% |
| Memory size | 7.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 52.75 |
| Q1 | 245.75 |
| median | 495.5 |
| Q3 | 745.25 |
| 95-th percentile | 945.5 |
| Maximum | 999 |
| Range | 999 |
| Interquartile range (IQR) | 499.5 |
Descriptive statistics
| Standard deviation | 288.1005611 |
|---|---|
| Coefficient of variation (CV) | 0.5794624558 |
| Kurtosis | -1.202928275 |
| Mean | 497.1858974 |
| Median Absolute Deviation (MAD) | 249.3747261 |
| Skewness | 0.009336325751 |
| Sum | 465366 |
| Variance | 83001.93332 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 999.], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 999 | 1 | 0.1% | |
| 326 | 1 | 0.1% | |
| 339 | 1 | 0.1% | |
| 338 | 1 | 0.1% | |
| 337 | 1 | 0.1% | |
| 336 | 1 | 0.1% | |
| 334 | 1 | 0.1% | |
| 333 | 1 | 0.1% | |
| 332 | 1 | 0.1% | |
| 331 | 1 | 0.1% | |
| Other values (926) | 926 | 98.9% |
| Value | Count | Frequency (%) | |
| 0 | 1 | 0.1% | |
| 1 | 1 | 0.1% | |
| 2 | 1 | 0.1% | |
| 3 | 1 | 0.1% | |
| 4 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 999 | 1 | 0.1% | |
| 998 | 1 | 0.1% | |
| 997 | 1 | 0.1% | |
| 996 | 1 | 0.1% | |
| 995 | 1 | 0.1% |
| Distinct count | 936 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 498.18589743589746 |
|---|---|
| Minimum | 1 |
| Maximum | 1000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 53.75 |
| Q1 | 246.75 |
| median | 496.5 |
| Q3 | 746.25 |
| 95-th percentile | 946.5 |
| Maximum | 1000 |
| Range | 999 |
| Interquartile range (IQR) | 499.5 |
Descriptive statistics
| Standard deviation | 288.1005611 |
|---|---|
| Coefficient of variation (CV) | 0.5782993108 |
| Kurtosis | -1.202928275 |
| Mean | 498.1858974 |
| Median Absolute Deviation (MAD) | 249.3747261 |
| Skewness | 0.009336325751 |
| Sum | 466302 |
| Variance | 83001.93332 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 1. 1000.], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 1000 | 1 | 0.1% | |
| 327 | 1 | 0.1% | |
| 340 | 1 | 0.1% | |
| 339 | 1 | 0.1% | |
| 338 | 1 | 0.1% | |
| 337 | 1 | 0.1% | |
| 335 | 1 | 0.1% | |
| 334 | 1 | 0.1% | |
| 333 | 1 | 0.1% | |
| 332 | 1 | 0.1% | |
| Other values (926) | 926 | 98.9% |
| Value | Count | Frequency (%) | |
| 1 | 1 | 0.1% | |
| 2 | 1 | 0.1% | |
| 3 | 1 | 0.1% | |
| 4 | 1 | 0.1% | |
| 5 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 1000 | 1 | 0.1% | |
| 999 | 1 | 0.1% | |
| 998 | 1 | 0.1% | |
| 997 | 1 | 0.1% | |
| 996 | 1 | 0.1% |
| Distinct count | 935 |
|---|---|
| Unique (%) | 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.4 KiB |
| The Host | 2 |
|---|---|
| Divergent | 1 |
| Godzilla | 1 |
| Juno | 1 |
| Resident Evil: Retribution | 1 |
| Other values (930) |
| Value | Count | Frequency (%) | |
| The Host | 2 | 0.2% | |
| Divergent | 1 | 0.1% | |
| Godzilla | 1 | 0.1% | |
| Juno | 1 | 0.1% | |
| Resident Evil: Retribution | 1 | 0.1% | |
| 22 Jump Street | 1 | 0.1% | |
| A Kind of Murder | 1 | 0.1% | |
| Easy A | 1 | 0.1% | |
| Oblivion | 1 | 0.1% | |
| Step Up 2: The Streets | 1 | 0.1% | |
| Other values (925) | 925 | 98.8% |
Length
| Max length | 61 |
|---|---|
| Mean length | 14.65384615 |
| Min length | 2 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 31 | 39.2% | |
| Uppercase_Letter | 26 | 32.9% | |
| Decimal_Number | 10 | 12.7% | |
| Other_Punctuation | 8 | 10.1% | |
| Close_Punctuation | 1 | 1.3% | |
| Space_Separator | 1 | 1.3% | |
| Dash_Punctuation | 1 | 1.3% | |
| Open_Punctuation | 1 | 1.3% |
| Value | Count | Frequency (%) | |
| Latin | 57 | 72.2% | |
| Common | 22 | 27.8% |
| Value | Count | Frequency (%) | |
| ASCII | 74 | 100.0% |
| Distinct count | 200 |
|---|---|
| Unique (%) | 21.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.4 KiB |
| Action,Adventure,Sci-Fi | 50 |
|---|---|
| Drama | 43 |
| Comedy,Drama,Romance | 32 |
| Comedy | 30 |
| Drama,Romance | 28 |
| Other values (195) |
| Value | Count | Frequency (%) | |
| Action,Adventure,Sci-Fi | 50 | 5.3% | |
| Drama | 43 | 4.6% | |
| Comedy,Drama,Romance | 32 | 3.4% | |
| Comedy | 30 | 3.2% | |
| Drama,Romance | 28 | 3.0% | |
| Animation,Adventure,Comedy | 26 | 2.8% | |
| Action,Adventure,Fantasy | 26 | 2.8% | |
| Comedy,Drama | 25 | 2.7% | |
| Comedy,Romance | 25 | 2.7% | |
| Crime,Drama,Mystery | 22 | 2.4% | |
| Other values (190) | 629 | 67.2% |
Length
| Max length | 26 |
|---|---|
| Mean length | 18.20512821 |
| Min length | 5 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 18 | 58.1% | |
| Uppercase_Letter | 11 | 35.5% | |
| Dash_Punctuation | 1 | 3.2% | |
| Other_Punctuation | 1 | 3.2% |
| Value | Count | Frequency (%) | |
| Latin | 29 | 93.5% | |
| Common | 2 | 6.5% |
| Value | Count | Frequency (%) | |
| ASCII | 31 | 100.0% |
| Distinct count | 607 |
|---|---|
| Unique (%) | 64.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.4 KiB |
| Ridley Scott | 8 |
|---|---|
| M. Night Shyamalan | 6 |
| Michael Bay | 6 |
| David Yates | 6 |
| Paul W.S. Anderson | 6 |
| Other values (602) |
| Value | Count | Frequency (%) | |
| Ridley Scott | 8 | 0.9% | |
| M. Night Shyamalan | 6 | 0.6% | |
| Michael Bay | 6 | 0.6% | |
| David Yates | 6 | 0.6% | |
| Paul W.S. Anderson | 6 | 0.6% | |
| Peter Berg | 5 | 0.5% | |
| Justin Lin | 5 | 0.5% | |
| Danny Boyle | 5 | 0.5% | |
| Denis Villeneuve | 5 | 0.5% | |
| Antoine Fuqua | 5 | 0.5% | |
| Other values (597) | 879 | 93.9% |
Length
| Max length | 32 |
|---|---|
| Mean length | 13.13782051 |
| Min length | 3 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 38 | 55.1% | |
| Uppercase_Letter | 27 | 39.1% | |
| Other_Punctuation | 2 | 2.9% | |
| Dash_Punctuation | 1 | 1.4% | |
| Space_Separator | 1 | 1.4% |
| Value | Count | Frequency (%) | |
| Latin | 65 | 94.2% | |
| Common | 4 | 5.8% |
| Value | Count | Frequency (%) | |
| ASCII | 56 | 100.0% |
Year
Real number (ℝ≥0)
| Distinct count | 11 |
|---|---|
| Unique (%) | 1.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2012.7713675213674 |
|---|---|
| Minimum | 2006 |
| Maximum | 2016 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.4 KiB |
Quantile statistics
| Minimum | 2006 |
|---|---|
| 5-th percentile | 2007 |
| Q1 | 2010 |
| median | 2014 |
| Q3 | 2016 |
| 95-th percentile | 2016 |
| Maximum | 2016 |
| Range | 10 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 3.178987268 |
|---|---|
| Coefficient of variation (CV) | 0.001579408034 |
| Kurtosis | -0.8070367081 |
| Mean | 2012.771368 |
| Median Absolute Deviation (MAD) | 2.726020893 |
| Skewness | -0.6863119763 |
| Sum | 1883954 |
| Variance | 10.10596005 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[2006. 2012.5 2015.5 2016. ], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 2016 | 268 | 28.6% | |
| 2015 | 123 | 13.1% | |
| 2014 | 95 | 10.1% | |
| 2013 | 86 | 9.2% | |
| 2012 | 62 | 6.6% | |
| 2010 | 59 | 6.3% | |
| 2011 | 58 | 6.2% | |
| 2009 | 49 | 5.2% | |
| 2008 | 49 | 5.2% | |
| 2007 | 46 | 4.9% |
| Value | Count | Frequency (%) | |
| 2006 | 41 | 4.4% | |
| 2007 | 46 | 4.9% | |
| 2008 | 49 | 5.2% | |
| 2009 | 49 | 5.2% | |
| 2010 | 59 | 6.3% |
| Value | Count | Frequency (%) | |
| 2016 | 268 | 28.6% | |
| 2015 | 123 | 13.1% | |
| 2014 | 95 | 10.1% | |
| 2013 | 86 | 9.2% | |
| 2012 | 62 | 6.6% |
Runtime
Real number (ℝ≥0)
| Distinct count | 92 |
|---|---|
| Unique (%) | 9.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 113.2724358974359 |
|---|---|
| Minimum | 66 |
| Maximum | 187 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.4 KiB |
Quantile statistics
| Minimum | 66 |
|---|---|
| 5-th percentile | 88 |
| Q1 | 100 |
| median | 111 |
| Q3 | 123 |
| 95-th percentile | 149 |
| Maximum | 187 |
| Range | 121 |
| Interquartile range (IQR) | 23 |
Descriptive statistics
| Standard deviation | 18.55079827 |
|---|---|
| Coefficient of variation (CV) | 0.1637715135 |
| Kurtosis | 0.6336593054 |
| Mean | 113.2724359 |
| Median Absolute Deviation (MAD) | 14.55930857 |
| Skewness | 0.7911194262 |
| Sum | 106023 |
| Variance | 344.1321164 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 66. 80.5 84.5 91.5 120.5 133.5 144.5 165.5 187. ], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 108 | 29 | 3.1% | |
| 117 | 26 | 2.8% | |
| 100 | 26 | 2.8% | |
| 110 | 25 | 2.7% | |
| 118 | 25 | 2.7% | |
| 102 | 25 | 2.7% | |
| 106 | 24 | 2.6% | |
| 104 | 22 | 2.4% | |
| 112 | 22 | 2.4% | |
| 101 | 21 | 2.2% | |
| Other values (82) | 691 | 73.8% |
| Value | Count | Frequency (%) | |
| 66 | 1 | 0.1% | |
| 73 | 1 | 0.1% | |
| 80 | 2 | 0.2% | |
| 81 | 4 | 0.4% | |
| 82 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 187 | 1 | 0.1% | |
| 180 | 2 | 0.2% | |
| 172 | 1 | 0.1% | |
| 170 | 1 | 0.1% | |
| 169 | 3 | 0.3% |
Rating
Real number (ℝ≥0)
| Distinct count | 55 |
|---|---|
| Unique (%) | 5.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.729166666666667 |
|---|---|
| Minimum | 1.9 |
| Maximum | 9.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.4 KiB |
Quantile statistics
| Minimum | 1.9 |
|---|---|
| 5-th percentile | 5.175 |
| Q1 | 6.2 |
| median | 6.8 |
| Q3 | 7.4 |
| 95-th percentile | 8.1 |
| Maximum | 9 |
| Range | 7.1 |
| Interquartile range (IQR) | 1.2 |
Descriptive statistics
| Standard deviation | 0.9352249579 |
|---|---|
| Coefficient of variation (CV) | 0.1389807987 |
| Kurtosis | 1.190310556 |
| Mean | 6.729166667 |
| Median Absolute Deviation (MAD) | 0.7355947293 |
| Skewness | -0.7045209798 |
| Sum | 6298.5 |
| Variance | 0.8746457219 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[1.9 3.8 4.55 5.15 5.65 6.15 7.35 8.15 8.55 9. ], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 6.7 | 47 | 5.0% | |
| 7 | 44 | 4.7% | |
| 7.1 | 44 | 4.7% | |
| 6.3 | 41 | 4.4% | |
| 7.3 | 40 | 4.3% | |
| 7.8 | 39 | 4.2% | |
| 6.6 | 39 | 4.2% | |
| 7.2 | 39 | 4.2% | |
| 6.5 | 37 | 4.0% | |
| 6.2 | 36 | 3.8% | |
| Other values (45) | 530 | 56.6% |
| Value | Count | Frequency (%) | |
| 1.9 | 1 | 0.1% | |
| 2.7 | 1 | 0.1% | |
| 3.2 | 1 | 0.1% | |
| 3.5 | 2 | 0.2% | |
| 3.7 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 9 | 1 | 0.1% | |
| 8.8 | 1 | 0.1% | |
| 8.6 | 3 | 0.3% | |
| 8.5 | 6 | 0.6% | |
| 8.4 | 3 | 0.3% |
Votes
Real number (ℝ≥0)
| Distinct count | 933 |
|---|---|
| Unique (%) | 99.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 175270.21688034188 |
|---|---|
| Minimum | 61 |
| Maximum | 1791916 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.4 KiB |
Quantile statistics
| Minimum | 61 |
|---|---|
| 5-th percentile | 1586.25 |
| Q1 | 41593 |
| median | 114918.5 |
| Q3 | 249538 |
| 95-th percentile | 530938.75 |
| Maximum | 1791916 |
| Range | 1791855 |
| Interquartile range (IQR) | 207945 |
Descriptive statistics
| Standard deviation | 190582.4207 |
|---|---|
| Coefficient of variation (CV) | 1.08736341 |
| Kurtosis | 11.27174861 |
| Mean | 175270.2169 |
| Median Absolute Deviation (MAD) | 137161.3218 |
| Skewness | 2.493379996 |
| Sum | 164052923 |
| Variance | 3.632165907e+10 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[6.1000000e+01 3.8450000e+02 2.4555000e+03 9.2660000e+03 1.1611500e+05 2.2172900e+05 3.5732450e+05 5.9080900e+05 1.0466675e+06 1.7919160e+06], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 291 | 2 | 0.2% | |
| 97141 | 2 | 0.2% | |
| 1427 | 2 | 0.2% | |
| 125693 | 1 | 0.1% | |
| 406219 | 1 | 0.1% | |
| 299718 | 1 | 0.1% | |
| 461509 | 1 | 0.1% | |
| 92868 | 1 | 0.1% | |
| 240323 | 1 | 0.1% | |
| 101058 | 1 | 0.1% | |
| Other values (923) | 923 | 98.6% |
| Value | Count | Frequency (%) | |
| 61 | 1 | 0.1% | |
| 102 | 1 | 0.1% | |
| 115 | 1 | 0.1% | |
| 164 | 1 | 0.1% | |
| 173 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 1791916 | 1 | 0.1% | |
| 1583625 | 1 | 0.1% | |
| 1222645 | 1 | 0.1% | |
| 1047747 | 1 | 0.1% | |
| 1045588 | 1 | 0.1% |
Revenue
Real number (ℝ≥0)
| Distinct count | 789 |
|---|---|
| Unique (%) | 84.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 80.78591346153847 |
|---|---|
| Minimum | 0.01 |
| Maximum | 936.63 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.4 KiB |
Quantile statistics
| Minimum | 0.01 |
|---|---|
| 5-th percentile | 0.3275 |
| Q1 | 17.5225 |
| median | 47.985 |
| Q3 | 102.4225 |
| 95-th percentile | 292.075 |
| Maximum | 936.63 |
| Range | 936.62 |
| Interquartile range (IQR) | 84.9 |
Descriptive statistics
| Standard deviation | 99.49466277 |
|---|---|
| Coefficient of variation (CV) | 1.231584301 |
| Kurtosis | 11.92658652 |
| Mean | 80.78591346 |
| Median Absolute Deviation (MAD) | 68.18833642 |
| Skewness | 2.76636849 |
| Sum | 75615.615 |
| Variance | 9899.187921 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[1.00000e-02 5.50000e-02 3.35000e-01 4.30500e+00 4.79675e+01 ... 1.03085e+02 1.70370e+02 2.60890e+02 4.23840e+02 9.36630e+02], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 47.985 | 99 | 10.6% | |
| 0.03 | 5 | 0.5% | |
| 0.04 | 4 | 0.4% | |
| 0.32 | 4 | 0.4% | |
| 0.05 | 4 | 0.4% | |
| 0.02 | 4 | 0.4% | |
| 0.01 | 4 | 0.4% | |
| 0.54 | 3 | 0.3% | |
| 2.2 | 3 | 0.3% | |
| 0.15 | 3 | 0.3% | |
| Other values (779) | 803 | 85.8% |
| Value | Count | Frequency (%) | |
| 0.01 | 4 | 0.4% | |
| 0.02 | 4 | 0.4% | |
| 0.03 | 5 | 0.5% | |
| 0.04 | 4 | 0.4% | |
| 0.05 | 4 | 0.4% |
| Value | Count | Frequency (%) | |
| 936.63 | 1 | 0.1% | |
| 760.51 | 1 | 0.1% | |
| 652.18 | 1 | 0.1% | |
| 623.28 | 1 | 0.1% | |
| 533.32 | 1 | 0.1% |
Metascore
Real number (ℝ≥0)
| Distinct count | 84 |
|---|---|
| Unique (%) | 9.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 58.98504273504273 |
|---|---|
| Minimum | 11.0 |
| Maximum | 100.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.4 KiB |
Quantile statistics
| Minimum | 11 |
|---|---|
| 5-th percentile | 31 |
| Q1 | 47 |
| median | 59.5 |
| Q3 | 72 |
| 95-th percentile | 85 |
| Maximum | 100 |
| Range | 89 |
| Interquartile range (IQR) | 25 |
Descriptive statistics
| Standard deviation | 17.19475702 |
|---|---|
| Coefficient of variation (CV) | 0.2915104614 |
| Kurtosis | -0.6122051468 |
| Mean | 58.98504274 |
| Median Absolute Deviation (MAD) | 14.21000895 |
| Skewness | -0.1238873467 |
| Sum | 55210 |
| Variance | 295.6596691 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 11. 21. 29.5 45.5 83.5 88.5 100. ], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 66 | 25 | 2.7% | |
| 72 | 25 | 2.7% | |
| 68 | 25 | 2.7% | |
| 64 | 24 | 2.6% | |
| 57 | 23 | 2.5% | |
| 51 | 22 | 2.4% | |
| 65 | 22 | 2.4% | |
| 48 | 21 | 2.2% | |
| 81 | 21 | 2.2% | |
| 76 | 21 | 2.2% | |
| Other values (74) | 707 | 75.5% |
| Value | Count | Frequency (%) | |
| 11 | 1 | 0.1% | |
| 15 | 1 | 0.1% | |
| 16 | 1 | 0.1% | |
| 18 | 4 | 0.4% | |
| 19 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 100 | 1 | 0.1% | |
| 99 | 1 | 0.1% | |
| 98 | 1 | 0.1% | |
| 96 | 4 | 0.4% | |
| 95 | 3 | 0.3% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
First rows
| df_index | Rank | Title | Genre | Director | Year | Runtime | Rating | Votes | Revenue | Metascore | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 1 | Guardians of the Galaxy | Action,Adventure,Sci-Fi | James Gunn | 2014 | 121 | 8.1 | 757074 | 333.130 | 76.0 |
| 1 | 1 | 2 | Prometheus | Adventure,Mystery,Sci-Fi | Ridley Scott | 2012 | 124 | 7.0 | 485820 | 126.460 | 65.0 |
| 2 | 2 | 3 | Split | Horror,Thriller | M. Night Shyamalan | 2016 | 117 | 7.3 | 157606 | 138.120 | 62.0 |
| 3 | 3 | 4 | Sing | Animation,Comedy,Family | Christophe Lourdelet | 2016 | 108 | 7.2 | 60545 | 270.320 | 59.0 |
| 4 | 4 | 5 | Suicide Squad | Action,Adventure,Fantasy | David Ayer | 2016 | 123 | 6.2 | 393727 | 325.020 | 40.0 |
| 5 | 5 | 6 | The Great Wall | Action,Adventure,Fantasy | Yimou Zhang | 2016 | 103 | 6.1 | 56036 | 45.130 | 42.0 |
| 6 | 6 | 7 | La La Land | Comedy,Drama,Music | Damien Chazelle | 2016 | 128 | 8.3 | 258682 | 151.060 | 93.0 |
| 7 | 7 | 8 | Mindhorn | Comedy | Sean Foley | 2016 | 89 | 6.4 | 2490 | 47.985 | 71.0 |
| 8 | 8 | 9 | The Lost City of Z | Action,Adventure,Biography | James Gray | 2016 | 141 | 7.1 | 7188 | 8.010 | 78.0 |
| 9 | 9 | 10 | Passengers | Adventure,Drama,Romance | Morten Tyldum | 2016 | 116 | 7.0 | 192177 | 100.010 | 41.0 |
Last rows
| df_index | Rank | Title | Genre | Director | Year | Runtime | Rating | Votes | Revenue | Metascore | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 926 | 988 | 989 | Martyrs | Horror | Pascal Laugier | 2008 | 99 | 7.1 | 63785 | 47.985 | 89.0 |
| 927 | 990 | 991 | Underworld: Rise of the Lycans | Action,Adventure,Fantasy | Patrick Tatopoulos | 2009 | 92 | 6.6 | 129708 | 45.800 | 44.0 |
| 928 | 991 | 992 | Taare Zameen Par | Drama,Family,Music | Aamir Khan | 2007 | 165 | 8.5 | 102697 | 1.200 | 42.0 |
| 929 | 993 | 994 | Resident Evil: Afterlife | Action,Adventure,Horror | Paul W.S. Anderson | 2010 | 97 | 5.9 | 140900 | 60.130 | 37.0 |
| 930 | 994 | 995 | Project X | Comedy | Nima Nourizadeh | 2012 | 88 | 6.7 | 164088 | 54.720 | 48.0 |
| 931 | 995 | 996 | Secret in Their Eyes | Crime,Drama,Mystery | Billy Ray | 2015 | 111 | 6.2 | 27585 | 47.985 | 45.0 |
| 932 | 996 | 997 | Hostel: Part II | Horror | Eli Roth | 2007 | 94 | 5.5 | 73152 | 17.540 | 46.0 |
| 933 | 997 | 998 | Step Up 2: The Streets | Drama,Music,Romance | Jon M. Chu | 2008 | 98 | 6.2 | 70699 | 58.010 | 50.0 |
| 934 | 998 | 999 | Search Party | Adventure,Comedy | Scot Armstrong | 2014 | 93 | 5.6 | 4881 | 47.985 | 22.0 |
| 935 | 999 | 1000 | Nine Lives | Comedy,Family,Fantasy | Barry Sonnenfeld | 2016 | 87 | 5.3 | 12435 | 19.640 | 11.0 |